PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA04g03990
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 729aa    MW: 80142.8 Da    PI: 5.6189
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA04g03990genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox505.2e-1662117156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 +++ +++t+ q++e+e++   +++p+ ++r+eL kklgL+  qVk+WFqN+R+++k
  CA04g03990  62 KKRYHRHTQIQIQEMESFNFHCPHPDDKQRKELGKKLGLEPLQVKFWFQNKRTQMK 117
                 688899*********987777********************************998 PP

2START208.62.4e-652484681206
                 HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEEC CS
       START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevis 86 
                 ela +a++el+++a+ +ep+W k      e + ++e+ ++f+++ +      ++ea+r+s+vv+m++ +lve+l+d++ qW+  +a    +  tlev+s
  CA04g03990 248 ELAVAAMEELIRMAQTGEPLWIKTLdnssETLSEEEYFRTFPQGIGpkplgLTSEASRESAVVIMNHINLVEILMDVN-QWTSVFAglvsRSLTLEVLS 345
                 57899*******************999999999**********999********************************.******************** PP

                 TT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXH CS
       START  87 sg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlph 178
                 +g      galq+m+ae+q++splvp R+ +fvRy++ + +g+w++vdvS+d+ ++      v R +++pSg+li++++ng+skvtw+ehv++++r+ h
  CA04g03990 346 TGvagnynGALQVMTAEFQVPSPLVPtRENYFVRYCKHHADGTWAVVDVSLDHLRPTA----VSRDRRRPSGCLIQELPNGYSKVTWIEHVEVDDRSAH 440
                 *******************************************************976....79999******************************** PP

                 HHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
       START 179 wllrslvksglaegaktwvatlqrqcek 206
                 +++r+lv+sgla+gak+wvatl+rqce+
  CA04g03990 441 NIYRPLVNSGLAFGAKRWVATLDRQCER 468
                 **************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.0E-2044117IPR009057Homeodomain-like
SuperFamilySSF466895.01E-1655124IPR009057Homeodomain-like
PROSITE profilePS5007114.33359119IPR001356Homeobox domain
SMARTSM003891.5E-1360123IPR001356Homeobox domain
CDDcd000861.59E-1462120No hitNo description
PfamPF000461.2E-1362117IPR001356Homeobox domain
PROSITE profilePS5084846.24239471IPR002913START domain
SuperFamilySSF559612.2E-36240470No hitNo description
CDDcd088755.90E-126243467No hitNo description
SMARTSM002345.7E-68248468IPR002913START domain
PfamPF018526.0E-55249468IPR002913START domain
Gene3DG3DSA:3.30.530.202.5E-6345468IPR023393START-like domain
SuperFamilySSF559613.37E-27488716No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009845Biological Processseed germination
GO:0009913Biological Processepidermal cell differentiation
GO:0048497Biological Processmaintenance of floral organ identity
GO:0048825Biological Processcotyledon development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 729 aa     Download sequence    Send to blast
MFQPNMFESH HHLLDMSHNK SPENDLDLIR DDEFESKSMA DIMENNPCGD DNEDPNQRPN  60
KKKRYHRHTQ IQIQEMESFN FHCPHPDDKQ RKELGKKLGL EPLQVKFWFQ NKRTQMKAQH  120
ERHENSELRA ENEKLRAENI RYKEALGNAS CPNCGGPASI GEMSFDEQHL RIENARLREE  180
IDRISGIAAK YVGKPMLTYP NLSTTGPLDL GVGNFGPQTG LVGEIYNASD LLRSVSGPID  240
ADKPIIIELA VAAMEELIRM AQTGEPLWIK TLDNSSETLS EEEYFRTFPQ GIGPKPLGLT  300
SEASRESAVV IMNHINLVEI LMDVNQWTSV FAGLVSRSLT LEVLSTGVAG NYNGALQVMT  360
AEFQVPSPLV PTRENYFVRY CKHHADGTWA VVDVSLDHLR PTAVSRDRRR PSGCLIQELP  420
NGYSKVTWIE HVEVDDRSAH NIYRPLVNSG LAFGAKRWVA TLDRQCERLA SAMAINIPTG  480
EVGVITSPDG RKSMLKLAER MVMSFCAGVG ASTAHTWTTL SGSGADDVRV MTRKSIDDPG  540
RPPGIVLSAA TSFWLPVSPK RVFDFLRDEN SRNEWDILSN GGLVQEMAHI ANGRDPGNCV  600
SLLRVNSGNS SQSNMLILQE SSTDSTGSYV IYAPVDIVAM NVVLSGGDPD YVALLPSGFA  660
ILPDGGGGIN TTCTSGSLLT VAFQILVDSI PTAKLSLGSV ATVNSLLKCT VERIKNALAC  720
DTMPDGKI*
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016570434.10.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A0V0IVG00.0A0A0V0IVG0_SOLCH; Putative homeobox-leucine zipper protein MERISTEM L1-like
STRINGPGSC0003DMT4000618590.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA9322491
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2